NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Regret Analysis of Multi-task Representation Learning for Linear-Quadratic Adaptive Control

https://doi.org/10.1609/aaai.v39i17.33987

Lee, Bruce D; Toso, Leonardo F; Zhang, Thomas T; Anderson, James; Matni, Nikolai (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Representation learning is a powerful tool that enables learning over large multitudes of agents or domains by enforcing that all agents operate on a shared set of learned features. However, many robotics or controls applications that would benefit from collaboration operate in settings with changing environments and goals, whereas most guarantees for representation learning are stated for static settings. Toward rigorously establishing the benefit of representation learning in dynamic settings, we analyze the regret of multi-task representation learning for linear-quadratic control. This setting introduces unique challenges. Firstly, we must account for and balance the misspecification introduced by an approximate representation. Secondly, we cannot rely on the parameter update schemes of single-task online LQR, for which least-squares often suffices, and must devise a novel scheme to ensure sufficient improvement. We demonstrate that for settings where exploration is benign, the regret of any agent after T timesteps scales with the square root of T/H, where H is the number of agents. In settings with difficult exploration, the regret scales as the square root of the input dimension times the parameter dimension multiplied by T, plus a term which scales with T to the three quarters divided by H to the one fifth. In both cases, by comparing to the minimax single-task regret, we see a benefit of a large number of agents. Notably, in the difficult exploration case, by sharing a representation across tasks, the effective task-specific parameter count can often be small. Lastly, we validate the trends we predict.
more » « less
Free, publicly-accessible full text available April 11, 2026
Regret analysis of multi-task representation learning for linear-quadratic adaptive control

Lee, Bruce D; Toso, Leonardo F; Zhang, Thomas T; Anderson, James; Matni, Nikolai (February 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Free, publicly-accessible full text available February 1, 2026
Applying Machine‐Learning Methods to Laser Acceleration of Protons: Lessons Learned From Synthetic Data

https://doi.org/10.1002/ctpp.202400080

Desai, Ronak; Zhang, Thomas; Felice, John J; Oropeza, Ricky; Smith, Joseph R; Kryshchenko, Alona; Orban, Chris; Dexter, Michael L; Patnaik, Anil K (March 2025, Contributions to Plasma Physics)

In this study, we consider three different machine‐learning methods—a three‐hidden‐layer neural network, support vector regression, and Gaussian process regression—and compare how well they can learn from a synthetic data set for proton acceleration in the Target Normal Sheath Acceleration regime. The synthetic data set was generated from a previously published theoretical model by Fuchs et al. 2005 that we modified. Once trained, these machine‐learning methods can assist with efforts to maximize the peak proton energy, or with the more general problem of configuring the laser system to produce a proton energy spectrum with desired characteristics. In our study, we focus on both the accuracy of the machine‐learning methods and the performance on one GPU including memory consumption. Although it is arguably the least sophisticated machine‐learning model we considered, support vector regression performed very well in our tests.
more » « less
Free, publicly-accessible full text available March 1, 2026
Sample-Efficient Linear Representation Learning from Non-IID Non-Isotropic Data

Zhang, Thomas TCK; Toso, Leonardo Felipe; Anderson, James; Matni, Nikolai (January 2024, ICLR 2024)

Full Text Available
Adversarial Tradeoffs in Robust State Estimation

https://doi.org/10.23919/ACC55779.2023.10156358

Zhang, Thomas T.; Lee, Bruce D.; Hassani, Hamed; Matni, Nikolai (May 2023, IEEE)
Multi-task Imitation Learning for Linear Dynamical Systems

Zhang, Thomas T.; Kang, Katie; Lee, Bruce D.; Tomlin, Claire; Levine, Sergey; Tu, Stephen; Matni, Nikolai (July 2023, L4DC - PMLR)

Full Text Available
Performance-Robustness Tradeoffs in Adversarially Robust Linear-Quadratic Control

https://doi.org/10.1109/CDC51059.2022.9992393

Lee, Bruce D.; Zhang, Thomas T.C.K.; Hassani, Hamed; Matni, Nikolai (December 2022, 2022 IEEE 61st Conference on Decision and Control (CDC))

Full Text Available
TaSIL: Taylor Series Imitation Learning

Pfrommer, Daniel; Zhang, Thomas TCK; Tu, Stephen; Matni, Nikolai (January 2022, Conference on Neural Information Processing Systems)

Full Text Available
Adversarially Robust Stability Certificates can be Sample-Efficient

Zhang, Thomas; Tu, Stephen; Boffi, Nicholas; Slotine, Jean-Jacques; Matni, Nikolai (January 2022, Learning for Dynamics and Control)

Full Text Available

Search for: All records